Empirical and theoretical support for lenient learning

نویسندگان

Daan Bloembergen

Michael Kaisers

Karl Tuyls

چکیده

Recently, an evolutionary model of Lenient Q-learning (LQ) has been proposed, providing theoretical guarantees of convergence to the global optimum in cooperative multi-agent learning. However, experiments reveal discrepancies between the predicted dynamics of the evolutionary model and the actual learning behavior of the Lenient Q-learning algorithm, which undermines its theoretical foundation. Moreover it turns out that the predicted behavior of the model is more desirable than the observed behavior of the algorithm. We propose the variant Lenient Frequency Adjusted Qlearning (LFAQ) which inherits the theoretical guarantees and resolves this issue. The advantages of LFAQ are demonstrated by comparing the evolutionary dynamics of lenient vs non-lenient Frequency Adjusted Q-learning. In addition, we analyze the behavior, convergence properties and performance of these two learning algorithms empirically. The algorithms are evaluated in the Battle of the Sexes (BoS) and the Stag Hunt (SH), while compensating for intrinsic learning speed differences. Significant deviations arise from the introduction of leniency, leading to profound performance gains in coordination games against both lenient and non-lenient learners.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Establishing an Argument-Based Validity Approach for a Low-Stake Test of Collocational Behavior

Most of the validation studies conducted across varying test application contexts are usually framed within the traditional conceptualization of validity and therefore lack a comprehensive framework to focus on test score interpretations and test score use. This study aimed at developing and validating a collocational behavior test (CBT), drawing on Kane's argument-based approach to validity. F...

متن کامل

Cloud Computing; A New Approach to Learning and Learning

Introduction: The cloud computing and services, as a technological solution for developing educational services, can accelerate the provision and expansion of these highly useful services. This study intended to provide an overall picture of practical areas of learning services based on cloud computing teaching and learning equipment. Methods: This was a theoretical hybrid research study in whi...

متن کامل

The Effect of Four Different Types of Involvement Indices on Vocabulary Learning and Retention of EFL Learners

The purpose of the present study was to provide empirical support for the construct of the involvement load hypothesis (ILH ) in an EFL context. To fulfill the purpose of the study, 4 intact groups consisting of 126 intermediate-level students participated in this experiment. In order to ensure that the participants were at the same level of English language proficiency, the Nelson test was adm...

متن کامل

Midlife crisis: a debate.

Without doubt, the midlife crisis is the most popular concept describing middle adulthood. Facing the limitation of the time until death, men in particular are believed to pause from actively pursuing their goals and review their achievements, take stock of what they have and have not yet accomplished, at times taking drastic measures to fulfill their dreams. This paper critically discusses the...

متن کامل

Reinforcement Learning in Multi-agent Games

This article investigates the performance of independent reinforcement learners in multiagent games. Convergence to Nash equilibria and parameter settings for desired learning behavior are discussed for Q-learning, Frequency Maximum Q value (FMQ) learning and lenient Q-learning. FMQ and lenient Q-learning are shown to outperform regular Q-learning significantly in the context of coordination ga...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2011

Empirical and theoretical support for lenient learning

نویسندگان

چکیده

منابع مشابه

Establishing an Argument-Based Validity Approach for a Low-Stake Test of Collocational Behavior

Cloud Computing; A New Approach to Learning and Learning

The Effect of Four Different Types of Involvement Indices on Vocabulary Learning and Retention of EFL Learners

Midlife crisis: a debate.

Reinforcement Learning in Multi-agent Games

عنوان ژورنال:

اشتراک گذاری